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1. Introduction 

In this paper we consider a mixed Poisson model for count data arising in Group Lifc insurance. 
We present a Bayesian formulation to determine the number of groups in an insurance portfolio 
consisting of claim numbers or deaths. We take a non-parametric Bayesian approach to modelling 
this mixture distribution using a Dirichlet process prior and use reversible jump Markov chain Monte 
Carlo to estimate the number of components in the mixture. The physical interpretation of the model 
in this case is that the heterogeneity is assumed to be drawn from one of a finite number of possible 
groups, in proportion which will be estimated. 

2. A Credibility Model for Heterogeneity 

The data arise from 1125 groups insured through the whole or parts of the period 1982-1985 by a 
major Norwegian insurance company. There are n = 12 classes distinguished by occupation category. 
T he i*^ class has r isk ex posure E j , and observed number of deaths, Di. The data are also analysed 



(j2000l ) and iNorbergl (jl989f ) . Let Z?i , . . . , Z?„ denote the number of observed deaths in 
each insured group. Associated with each group is the exposure, denote . . . ,£'„, respectively, 
which is a measure of the propensity of that group to produce claims/deaths. Let D" denote the 
collection of all deaths for each group, where 

Z)" = {-Di,...,-D„}. 

Similarly, let denote the collection of all exposures for the group 



_ {El, . . . , En}. 
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Figure [1] shows a plot of the claim number for each group while Figure [5] shows the claim numbers 
normalized by their corresponding exposures. 

The heterogeneity model is used to model differences in each of the n groups. For each group, 
the exposures are recorded and the resulting number of deaths or claims are then recorded for 
each group. At the first level, the number of claims for each group is assumed to follow a Poisson 
distribution with parameter XiEi, for i = 1, . . . , n. Thus 

Di ~ Voisson{\iEi), i — I, . . . ,n. 

We take a fully Bayesian approach and assume that the are IID and follow a Gamma distribution 
with parameters a and /3; that is, 

Xi ~ Qamma{a, P), 
where a and /3 are also assumed to be unknown. 

The advantage of using such mixed distributions is that it allows for extra variation in the number 
of occurrences since 

£(£>) = E(E(i:>|A)) = ¥.{E\) = Ea/P 

and 

Yar{D) ^¥.{Nar{D\\)) +Var{¥.{D\\)) 
^Ea/P + E'^a/P^ >£(£)). 



3. Extending the Basic IVIodel 



In an analysis based on the model described in Section!^ iHaastrud (|2000r) assume s that each 



grou 



has its own unique heterogeneity parameter A, drawn from some distribution. iHaastrud ()200i 
assumes that each class i has a unique heterogeneity parameter, denoted Ai, and that the number 
of deaths Di in this class follows a Poisson distribution with mean XiEi. The classes are assumed 
to be mutually independent, given the heterogeneity parameters Ai, A2, . . . , A„. Furthermore, he 
assumes that this distribution is identical for each group. In practice, large values of Ei will account 
for large values of Di, which will lead to similar values of A^ for each i. 

In our analysis, we propose a mixture model formulation. We assume that Di, given Aj, has a 
Poisson distribution with mean XjEi. We take a non-parametric Bayesian approach to modelling 
this mixture distribution using a Dirichlet process prior, and use reversible jump Markov chain Monte 
Carlo to estimate the number of components in the mixture. In this case, the physical interpretation 
of the model is that the heterogeneity is assumed to be drawn from one of k possible groups, in 
proportions Wi, . . . , Wk- 
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Fig. 1. Plot of the number of observed claims for each Group. 
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Fig. 2. Plot of the number of observed claims per unit exposure. 



4. Mixture Formulation 



The method we describe is essentially a classification problem where we assume that each observed 
Di comes from any one of k components, where each component has a Poisson distribution. Thus 



Di\\j Voisson(EiXj) j 



1, 



.k\i — 1, 



More general forms of the mixture Poisson model with covariates are discussed in 



1999 



( 2002 ) . Mix tu re models for grouped clai m numbers are considered bv lTremblavl (|l992l ) and 



while 



2000). 



Viallefont et al 



Green and Richardson 



Walhin and Paris 



Dellaportas et al.l (jl997l ) considers count data in Finance using split/merge moves, 



(|2002[ ) provides a more general discussion of mixtures of Poisson distribu- 
tions, using both split /merge moves and birth/de ath moves. Other methods for determining the 
numb er of components in a mix ture a re discussed by McLachlan and Peel ([20001) , iPhillips and Smith 
(|l996l ). ICarlin and ChibI (|l995h . and IStephend (|2000l ) who use Markov chains to model jointly the 
number of components and component values. The advantage of the Bayesian formulation is that 
we can place posterior probabilities on the order of the model. 



4. 1. The Likelihood Function 



Throughout our discussion, n will denote the number of data points and k will denote the number of 
components in the mixture formulation. For a finite mixture model, the observed likelihood function 
is 



i=i j=i 



(1) 



where the weights are non- negative and X^i^i — ^- Even for moderate values of n ar id k, this takes 



a lon g time to evaluate since there are k" terms when the inner sums are expanded (jCasella et al 



20001 ). Another form of the likelihood functio n will be derived shortly. Clas sical estimation pro- 



cedure s for mixture models are described by 
(I2OO0I). 



Titterington et al. 



(|l990l) and 



McLachlan and Peel 



Let Zi be a categorical random variable taking values in {1, 
respectively, so that 

p{Zi = j\w) = Wj. 



. , fc} with probabilities wi. 



Suppose that the conditional distribution of 1?^, given Zi = j, is Poisson{\j), j = l,...,fc. Let 
/j (-) denote a Poisson density with parameter Aj. Then the unconditional density of Di is given by 
/(A), where 



k 

/(A)=5]«^,/,(A|A,£;"), 
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since 

k 

/(A) =^/,(A|2. A,i?")p(z, -j) (2) 

3 = 1 
k 

^Y.w,f,{D,\\jE{). 

= 1 

With every pair {D,i,Ei), we associate a latent variable z^, which is an indicator variable that 
indicates which component of the mixture is associated with {Di,Ei). We have, Zi — j if the i*'' 
data point, (A, Ei), comes from the j*'' component of the mixture. Thus, for each i, we have 

Zi\w M{l;wi, . . .,Wk) 

and 

By incorporating the indicator variables the complete data likelihood is then 



L(D"|z,A,£;") = n/(A|A,,,A) 



1=1 

k 



=n n /(A I A,, A). (3) 

j=l {i:Zi=i] 

At times, especially for the fixed k case described below, it is more convenient to work with ([3]) as 
it involves multiplications only, rather than additions and multiplications, as in ([ij. 

The convenience of using the missing data formulation is that the posterior conditional distri- 
bution of the model parameters would be standard distributions. Also the augmented variables z 
allows us to see what component of the mixture the data points are assigned. 



4.2. Gibbs Updates for Fixed h: 

We consider a mixture of Poissons where, conditional on there being fc components in the mixture, 
we have 

k 

A— ^^«,/(-IA,,A). 
The weights Wj sum to one, and are non-negative, so that 

k 

^ = 1, and Wj > 0. (4) 

i=i 

/(A|A, Zi = j, Ei) Poisson(Aj A) with P{zi = j) = Wj 

and 



w ~ 'D{6i, ... ,4) 
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follows a Diriclilet distribution. We also make the additional assumption that the (5j's are equal to 
1 so that p{w) is a uniform distribution on the space described by (|4]). For the Poisson parameters 
Xj, we take Gamma priors, so that 

Xj ~ Gamma(a, b) j = 1, . . . , fc 

with the ordering constraint 

Ai < As < ... < Afc, (5) 

to ensure that the components are identifiable. The ordering constraint is not necessary for the Monte 
Carlo algorithm to work. However it does avoid the problem of label switching, since otherwise, any 
permutation of the indices {1, . . . , fc} will result in the same posterior distribution. 



Note that because we have the ordering constraint in Equation ([S]), the joint density of the 
collective A is 

p{X\a,l3,k) = klp{Xi\a,(3) ■ ■ •p(Afe|Q;, /3)/ai<a2<...<aJA). 

When k is fixed and known, the factorial term fc! does not affect the MCMC algorithm since it can 
be absorbed into the normalising constant. However, in the variable k case, it must be noted, since 
it is a factor in the reversible jump acceptance probability. The joint density of all unknowns is 

tt{w,X,z\D'') (xp{w\S)p{z\w)p{X\a,f3)L{D''\X,z,E"). (6) 

With the missing data formulation, the likelihood term L{D"\X, z, E"^) can be written 



L{D"\X,z,E)^Y[ 



1=1 

and 



A! 



p(^z\w) = n^?' 

i=i 

where Uj — ^{i\zi — j}, is the number of observations allocated to component j. The prior 
distributions are ^ 

p(A.|a,/3) = -|^Arie-''^'. 
r(a) 



Using Bayes' theorem, we have the following posterior conditional distributions: 
^(A,) « A^'e-^^^- X Ap '--'''e-'^-^'i^.-^'/(,^_,,,^^,)(A,) 



and 



7r(zi;|(5, z) p(w\S)p{z\w) 
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so that, 



w ~ 'D{Si + ni, . . . ,Sk + rik), 



where Uj = i^{i\zi — j}. For z, we update the aUocations using 



P{zi^ j) ccwjf{Di\\j,Ei) i = l,...,n; j = l,...,fc, 



so that, 



P{zt = j) 



w,f{D,\Xj,Ei) 
T.Uw,f{D.,\\,,E.) 



(7) 



This fohows from Equation The Gibbs algorithm for fixed k is then (|Robert and Casellal . 119991 ) 
Step 1: Simulate Zi from 

p{z., = j) (X ■Wjf{Di\\j,Ei) for j = 1, . . . , fc 
and compute rij, rijDj, njEj from 



Step 2: Simulate 



Step 3: Simulate 



Xj ^ Qamma{a + ?ij-Dj, /3 + nj£'j)I(Aj_i_Aj+i)(Aj) for j = 1, . . . , fc 



w ^ 2?(5 + ni, . . . , (5 + rife). 



4.3. Reversible Jump MCMC 

The Reversible jump algorithm is an extension of the Metropolis-Hastings algorithm. We assume 
there is a countable collection of candidate models, indexed by M <E A4 = {Mi, Al2,. ■ ■ , Mk}. We 
further assume that for each model Mi, there exists an unknown parameter vector 9i G M"* where 
rii, the dimension of the parameter vector, can vary with i. Typically we are interested in finding 
which models have the greatest posterior probabilities and also estimates of the parameters. Thus 
the unknowns in this modelling scenario will include the model index Mi as well as the parameter 
vector 6i. We assume that the models and corresponding parameter vectors have a joint density 
T:{Mi,6i). The reversible jump algorithm constructs a rever sible Markov chain on the state space 
A4 X Ua/ eAi which has tt as its stationary distribution (|Greenl . Il995f) . In many instances, and 
in particular for Bayesian problems this joint distribution is of the form 

TTiM,,9,) = TTiM,,9,\X) oc LiX\M„ 6,) p{M„ 0i), 

where the prior on [Mi, 6i) is often of the form 



p{M,,ei)^p{e,\Mi)p{M,) 
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with p{Mi) being the density of some counting distribution. 



Suppose now that we are at model Mi and a move to model Alj is proposed with probability . 
The corresponding move from 6i to 6j is achieved by using a deterministic transformation hij , such 
that 

(6>j,-i;) = /i„-(6>„m), (8) 

where u and v are random variables introduced to ensure dimension matching necessary for re- 
versibility. To ensure dimension matching we must have 



dim{9j) + dim(t)) ~ dim{6i) + dim(M). 



Brooks et al. 



For discussions abo ut possible choices for the function hij we refer the reader to iGreenI (jl995r ). and 
( 2003 ). Let 



tt{Mj,9j) q{v) Tji 



dhij{ei,u) 



d{e,,u) 



(9) 



TT{Mi,ei) q{u) Tij 

then the acceptance probability for a proposed move from model (Af^, 6i) to model (Afj, 6j) is 

min{l,A(0„6>,)} 

where q{u) and q{v) are the respective proposal den sities for u a nd t), and \dhij{6i,u) / d{6i,u)\ 
is the Jacobian of the transformation induced by hij . iGreenI (|l995h shows that the algorithm with 
acceptance probability given above simulates a Markov chain which is reversible and follows from 
the detailed balance equation 



n{Mi,ei)q{u)rij = 'K{Mj,e j)q{v)rji 



dhij{ei,u) 



Detailed balance is necessary to ensure reversibility and is a sufficient condition for the existence of 
a unique stationary distribution. For the reverse move from model Mj to model Af; it is easy to see 
that the transformation used is {Oi,u) = hYj^{9j,v) and the acceptance probability for such a move 
is 



min < 1 



7r(Aft,g») q{u) Tjj 
' T^{Mj,9j) q{v) Tj, 



dhij{Bi,u) 



min{l,A(0„0j)-i}. 



For inference regarding which model has the greater posterior probability we can base our analysis 
on a realisation of the Markov chain constructed above. The marginal posterior probability of model 

p{M,)f(X\Nh) 



n{M,\X) 



EM,eMPiM,)f{X\M,)' 



where 



f{x\AU)^ J L{x\M,,ei)p{e,\M,)de, 



is the marginal density of the data after integrating over the unknown parameters 6. In practice we 
estimate 'n{Mi\X) by counting the number of times the Markov chain visits model Mi in a single 
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long run after reaching stationarity. These between model moves described in this section are also 
augmented with within model Gibbs updates as given in Section [4.21 to update model parameters. 



5. Reversible Jump Model Selection 

To update the model order and thereby increase or decrease the number of components in the 
mixture, we use a combination of birth/death and split/merge moves as described below. We 
assume a uniform prior on the number of components fc, so that 

where k„iax is chosen to allow the algorithm to explore all feasible models. We set k„iax — 72, 
the number of groups, as under our hypothesis, this is the maximum number of components in the 
mixture, k — kmax only when the groups are all distinct. Setting kmax — 72 will allow for direct 
comparison of the empirical Bayesian and the mixture model approach. 

Introducing a prior on the number of components k, we extend the joint density © of all 
parameters. Thus, now 



TT{k, w, z, A|D") cx p{k)p{w\S, k)p{X\a, (3, k)p{z\k)L{D"\\, z). 



(10) 



Note that the densities of the other model parameters now depend on k. In Sections 15.11 and 15. 3[ 
we describe in detail two algorithms which are used to simulate from this density. These algo- 
rithms are then combined with the fixed k updates of Section 14.21 to simulate from the density 
in Equation (IIOD. Modelhng mi xtures with and without the Dirichlet process prior is considered 
by iGreen and RichardsonI (|200lh . who also considers the case of an unknown number of compo- 
nents. Alternatives to the r eversible jump algorithm in this context do exist, see for example 
Dellaportas and KarlisI (|200l[ ). who develop a semi-parametric sample based method to approxi- 
mate a mixing density g{9) based on the method of moments. 



5. 1 . Split and Merge Moves 

N ote that the joint density in Equation (1101) now dep ends on k. We use the split/merge method 
of 



Dellaportas et al.1 (|l997l ) and 



Viallefont et al. 



components, let 



(|2002l) . Suppose we are at a configuration with k 
Sk = {(Ai,u)i), . . . , (Afc,Wfc)}, 



and suppose a move to increase the number of components is proposed. We select uniformly one of 
the current k components to be split. Suppose the component, (Aj,Wj), is selected to be split 
into two components (Ajj,Wjj) and (AjjjUijj) such that ji = j and j2 = .7 + 1, the components 
originally numbered j + I, . . . , k are then renumbered j + 2,...,fc + l. The split is also designed 
so that the first two moments of the split component remains the same as the original component. 
Thus, we simulate ui and U2 from densities defined on the interval [0,1]. Usually, we use Beta 
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densities and set 



Wj-^ = WjUi, 

— Wj{l — Ml), 

Aji — XjU2, 

^32 = Aj(l -UiU2)/(l -Ml). 

Other choices for splitting and merging components are described in 
proposed parameter is then 



Viallefont et al. 



pOOa ). The 



6k+i = {{Xi,wi),.. . , (Aj_i,Wj_i), {Xj,,WjJ, {Xj2,Wj^), {Xj+i,Wj+i), {Xk,Wk)}. 



If the ordering constraint in Equation ([S]) is not satisfied then the move is rejected immediately, 
as the reverse move in which we merge two adjacent components would not be possible. We can 
compute the Jacobian for this transformation as 



de 



fc+1 



d{0k,ui,u2) 



d{wj^,Wj^,Xj^,Xj^) 



d{wj,Xj,ui,u2) 



h'^3 

1 — Ul 



(11) 



For the reverse move, we select a pair of adjacent components ji and i2- Combining them, to 
get a new component labelled j, we set 



Wj=wj^+Wj,„ Xj 



Wj-^ Ajj + Aj 



by keeping the first two moments of the proposed and current configuration constant. We then 
sample a new set of allocation variables according to Equation ([7]). We also keep track of the 
probability of each allocation, so that Pa{z) represents the probability of a given allocation. To 
compute Pa{z), we first simulate Zi using Equation ([7]). For each i, the probability of that allocation 
is given by 

w,JiD,\X,^,E,) 



Pa{zi) 



j:U^3fiD^\X,,E, 



Finally, we compute the probability of all allocations by 

n 

Pa{z) = Y[pa{Zi). 



5.2. Acceptance Probability 

The acceptance probability of a move of type (fc, 6}~) =>- (fe', 6^') is then min{l, A^^k'}, where 



A 



7T{k',ek') p{k'^k) 1 

k k' = — ^ X ^ X 

n{k,ek) p(k^k') q{ui)q{u2) 



dOk' 



d{6k,ui,u2) 
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p{k')p{w'\S, k')p{X'\a, /3, fcO■L(g"|A^ z') p(z>^ fc + l)/p,(z^ 

Ak^k' — — — : — — — ^ ■ , - , „ , . ^ X — — X 



p{k)p{w\d, k)p{X\a, l3, k)L{D" \X, z) 



p{z\w,k)/pa{z) 

p{k' =^ k) 1 



p(k =^ k') q{ui)q{u2) d{6k,ui, U2) 



With k' = k + 1 this becomes 



_ p{k + 1) p{w'\5,k + l) p(y|a,/?,fc + l) Xp"|V,wO 
^'^+^^ p[k) p{w\S,k) p{\\a,l3,k) X(D"|A,u;) ^ 

p{k + l^ k) 



1 



dek+i 



p{k^k + l) q{ui)q{u2) d{6k,ui,U2) 
Now with a uniform prior on the number of components k and the weights 

r(fc + l) (fc + l)p(A,-, \a,l3)p{\, , |a, /3) p(z>^ fc + 1)/Pa(z0 

k)/pa{z) 

dOk+i 



A 



k,k+l 



m 



p{Xj\a,l3) 



L{D''\\',z') TOfe+i 

X X 



1 



L{D'^\\,z) Sk q{ui)q{u2) d{6k,Ui,U2) 
where the ratio of Gamma terms comes from the ratio of the prior distributions on w' and w and 

p{k + l^ k) _ nik+i/{k + 1-1) _ nik+i 



p{k^ k + 1) 



Sk/k 



Sk 



5.3. Birth and Deatti Moves 

Suppose we are now at model Mk with k components, say 

ek^{iXi,w,),...AXk,wk)}. (12) 

If a move is proposed to increase the number of components by one, then we simulate 

w ~ Beta{l, fc) and A ~ Qamma{a, b), 

independently. The proposed new component will then have weight w and the other weights are then 
scaled by a factor of {1 — w), so that the sum of the weights remain 1. The corresponding Poisson 
parameter for the proposed component in A. Note that A is sampled from its prior distribution. The 
proposed component is then 

Ok+i = {Xi,wi/{1 - w)), . . . ,{Xk,Wk/{l - w)),{X,w)}. (13) 

Using this proposed value of 6^+1, we also simulate proposed values for the allocations z' with model 
k + 1. Using the general form of the reversible jump acceptance probability, see for example 



Green 



(|1995I )). the probability of changing the number of components to fc + 1 is then min{l, ^fc,fc+i}, 
where 

dOk+i 



^ _ 7r(fc + l,6>fc+i) ^ p(fc + l^fc) ^ 1 

TT{k,ek) Pik ^ k + I) q(w)qCX) 



d{9k,w,X) 
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Making the necessary substitutions yield 



_ p{k + l)p{w'\S, k + l)p{X'\a, (3, k + 1)L{D'^\X' , z') 
'''''^^ ^ p{k)p{w\S, k)p{X\a, 13, k)L{D''\X, z) ^ 

p{z'\w',k + l)/paiz') ^ p{k + l^ k) ^ 



de 



fe+i 



p{z\w,k + l)lpa{z) p{k^k + l) q{w)q{X) 9(6»fc,w,A) 
Using Equations (fT2|) and (fT3|) we then have the Jacobian 



(14) 



de 



k+i 



r.^fc-l 



d{ek,w, X) 

If we denote the probabihty of a birth when there are k components by bk , and the probabihty of a 
death by dk, with bk + dk = 1, then 

pik + l^k) _ dk+i/{k + l) 



p{k 



1 



bk 



since for the move to be reversible we would then be able to kill k + 1 components in the new model, 
each with equal probability. Substituting these values in Equation (jl4l) . the ratio Ak,k+i reduces to 



r(fc + i) „^ , L{n"\x\z') p{z'\w' k + i)/paiz') ^^ 

Ak,k+1 = ^ X (fc + l)p(A) X ,. ^ X — r^^ — X 



m 



L{D^\X,z) 



p{z\w,k)/pa{z) 

dk+i/ik + 1) 1 



q{w)q{X) 



X (1 - w) 



r.\fc-l 



which on substituting q{X) = p{X) and q{w) — k{l — w) ^ further reduces to 



A 



k,k+l 



p{z'\w\k + l)/pa{z')LiD"\X'.,z') d, 



k+1 



(15) 



p{z\w,k)/pa{z) L{D^\X,z) bk 
For a proposed death move, the acceptance probability is then 

min{l,^fcl+i}- 

Even though the algorithm simulates new values of for the allocations when proposing to move, it 
is not necessary to carry the allocations along. For between-model moves, we could replace the 
missing data formulation by noting that 



fc) 

Pa{z) 



i(i:)"|A,z) = X(D"|A,w). 



Thus we could updat e model parameters using a scheme which does not require conjugacy; see for 



example ICappe et al 



(2003 



33). 



6. Results 

We now present some numerical results for this dataset based on the model described in Section!?] 
and using the algorithms described in Section [S] 
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Table 1. Posterior Model Order. 



Model Order 


Posterior Probability 


k 


7r(fc|I>",S") 


1 


0.00000 


2 


0.59485 


3 


0.29058 


4 


0.08588 


5 


0.02258 


6 


0.00448 


7 


0.00104 


8 


0.00034 


9 


0.00026 


10 


0.00000 



Table 2. Acceptance Rates. 



Scheme 


Acceptance Rate 


Birth/Death 


0.077 


Split/Merge 


0.055 


Birth/Death and Split /Merge 


0.066 



Table [T] shows the posterior model probabilities calculated from the reversible jump algorithm by 
counting the proportion of ties the algorithm visits each model. A plot of the number of components 
as the chain evolves is shown in Figure 3(a) The results show clearly that the number of components 
has a posterior mode at k — 2. Also, the model with k = 1 component is never visited. If the 
algorithm is started with k = 1 then immediately it jumps to fc = 2 and never returns to fc = 1. Since 
more than 88% of the posterior probability mass is placed on the models with 2 or 3 components, 
we discuss those models in detail in Section [Ql The between- model acceptance rates were 7.7% 
and 5.5% for the birth/death and split/merge moves, respectively. The total acceptance rate when 
there is equal probability of proposing a birth/death move or a split/merge move, is 6.6%. These 
results are tabulated in Table [2] 

To assess convergence of the algorithm, we simulated 4 chains using different starting values and 
different random number seeds for a total of 100000 iterations. Both the x-square and Kolmogorov- 
Smirnov diagnostics are computed. These diagnostics are plotted in Figure S) 



6. 1. Comparing ttie Model Move Schemes 

A comparison of the individual acceptance probabilities shows that the between model moves are 
accepted with a larger rate for the birth death scheme compared with the spli t merge scheme . 



This might not always be true, as other split merge schemes may be proposed (jViallefont et al 



2OO2I ). It is interesting to note that although the birth and death rates are higher than the split 



and merge rates, the combined scheme seems to mix better than either scheme implemented alone. 
Even though the birth and death scheme have a higher acceptance rate for between-model moves, 
the excursions away from the values of highest posterior density, k — 2 and fc = 3, are longer than 
for the combined scheme or the split and merge scheme since. This is because when proposing 
parameters independently from the prior, areas of low probability can be proposed, whereas, with 
the split and merge scheme, areas of low probability mass will generally be rejected. Based on the 
results presented here, the birth/death method would be the preferred algorithm. 
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(a) Birth/death and split/merge model trace. 
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(b) Birth/death and split /merge histogram. 
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(d) Birth/death histogram. 
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(e) Split/merge model trace. (f) Split/merge histogram. 

Fig. 3. Left: Model trace indicator. Right: Histogram of posterior model order. 
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Table 3. Parameters Estimates Condi- 
tional on fc = 2. 





Estimate 


95% HPD Interval 


Ai 


0.731 


(0.626, 0.839) 


Wi 


0.636 


(0.428, 0.821) 


Aa 


1.896 


(1.557, 2.249) 


W2 


0.363 


(0.178, 0.571) 


Table 4. Parameters Estimates Condi- 


tional on fc = 3. 






Estimate 


95% HPD Interval 


Ai 


0.462 


(0, 0.770) 


Wl 


0.299 


(0.000, 0.656) 


Aa 


1.115 


(0.625, 1.692) 


W2 


0.495 


(0.157, 0.797) 


As 


2.481 


(1.570, 3.366) 


W3 


0.204 


(0.002, 0.464) 



6.2. Detailed Results fork:=^2 and fc 3 

A histogram plot of the model indicator from the reversible jump algorithm of Section [S] shows 
that the most plausible model generating the claims in the portfolio is a mixture of two Poisson 
distributions. In this section we look further at the results, conditional on their being only two 
components, or three components, in the mixture. 

Recall the missing data formulation introduced in Section 2] for the number of components con- 
ditional on fc = 2, we observed the posterior distribution of z at each iteration when k = 2. A study 
of values of z will tell us how the data has been allocated to the components and therefore, which 
data points have been generated from either the first Poisson distribution or the second Poisson dis- 
tribution. This information, along with further information from the portfolio, will help insurance 
companies classify groups of life insurance portfolios. The parameter estimates are shown in Table [3] 

Similar results for the posterior parameter estimates, conditional on there being three components 
in the mixture, are given in Tabled 

Figure [S] show the posterior probability of each data point being allocated to a particular com- 
ponent of the mixture, conditional on fc = 2 and conditional fc = 3, respectively. 



7. Summary 

We present a model for heterogeneity in group life insurance. We show that the assumption of 
identical heterogeneity for all groups under consideration, may not necessarily hold. In this case, 
it is necessary to put similar groups together for further analysis. We employ a non-parametric 
approach any apply reversible jump methods to determine the number of compon ents in the mixture . 
An extensi on of the current work to the case where claims are grouped, such as (jWalhin and Paris , 



1999 



20001 1 ■ would therefore be appropriate. 
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Fig. 4. Convergence diagnostics. 




Fig. 5. Probability (vertical axis) of data from Group i (horizontal axis) being assigned to individual components 
conditional on = 2 (left) and fc = 3 (right). 
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